Inferring RNA Stem-Loop Descriptors from Multiple Sequence-structure Alignments for an Indexed-based RNA Search Method
نویسندگان
چکیده
Since the discovery of the variety of functional roles performed by nonprotein-coding RNA (ncRNA), the search for homologous RNAs has been a problem of great interest in the field of bioinformatics. This thesis presents a new technique that uses an index-bases search tool to perform RNA homology search. A multiple sequence alignment of an RNA family is used to generate search patterns/descriptors for an affix-array-based search tool which uses the descriptors to search a nucleotide sequence database for new members of the given RNA family. The results of the search are evaluated and compared to results produced by other famous tools that perform a similar task using different techniques. In addition, the thesis introduces two extensions that were developed for the index-bases search tool used.
منابع مشابه
SupeRNAlign: a new tool for flexible superposition of homologous RNA structures and inference of accurate structure-based sequence alignments
RNA has been found to play an ever-increasing role in a variety of biological processes. The function of most non-coding RNA molecules depends on their structure. Comparing and classifying macromolecular 3D structures is of crucial importance for structure-based function inference and it is used in the characterization of functional motifs and in structure prediction by comparative modeling. Ho...
متن کاملSimulFold: Simultaneously Inferring RNA Structures Including Pseudoknots, Alignments, and Trees Using a Bayesian MCMC Framework
Computational methods for predicting evolutionarily conserved rather than thermodynamic RNA structures have recently attracted increased interest. These methods are indispensable not only for elucidating the regulatory roles of known RNA transcripts, but also for predicting RNA genes. It has been notoriously difficult to devise them to make the best use of the available data and to predict high...
متن کاملStem Stem Stem Loop Loop Loop LoopLoop Loop Loop Loop Loop Loop
Background: Pairwise stochastic context-free grammars (Pair SCFGs) are powerful tools for evolutionary analysis of RNA, including simultaneous RNA sequence alignment and secondary structure prediction, but the associated algorithms are intensive in both CPU and memory usage. The same problem is faced by other RNA alignment-and-folding algorithms based on Sankoff's 1985 algorithm. It is therefor...
متن کاملRelation Between RNA Sequences, Structures, and Shapes via Variation Networks
Background: RNA plays key role in many aspects of biological processes and its tertiary structure is critical for its biological function. RNA secondary structure represents various significant portions of RNA tertiary structure. Since the biological function of RNA is concluded indirectly from its primary structure, it would be important to analyze the relations between the RNA sequences and t...
متن کاملMSARI: multiple sequence alignments for statistical detection of RNA secondary structure.
We present a highly accurate method for identifying genes with conserved RNA secondary structure by searching multiple sequence alignments of a large set of candidate orthologs for correlated arrangements of reverse-complementary regions. This approach is growing increasingly feasible as the genomes of ever more organisms are sequenced. A program called msari implements this method and is signi...
متن کامل